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Integrated volatility and round-off error 
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We consider a microstructure model for a financial asset, allowing for price discreteness and 
for a diffusive behavior at large sampling scale. This model, introduced by Delattre and Jacod, 
consists in the observation at the high frequency n, with round-off error a„, of a diffusion 
on a finite interval. We give from this sample estimators for different forms of the integrated 
volatility of the asset. Our method is based on variational properties of the process associated 
with wavelet techniques. We prove that the accuracy of our estimation procedures is a„ Vn~^^^. 
Using compensated estimators, limit theorems are obtained. 

Keywords: diffusion models; high frequency data; integrated volatility; microstructure noise; 
round-off error; variation methods; wavelets 

1. Introduction 

Nowadays, a massive amount of high frequency financial data is available. This large 
quantity of data has paradoxically complicated some problems in statistical finance. 
Among them, one of the most relevant is the estimation of the integrated volatility of an 
asset. To fix ideas, let us consider an asset whose theoretical, efficient price {Xt)t£[o,i] 
follows an Ito process of the form 

dXt = fit dt + atdWt, 

where Wt is a Brownian motion, the drift process and the instantaneous volatifity. 
From market price observations, we wish to estimate the quantity 

where g is a known deterministic fmiction. The case g{x) — 1 corresponds to the abso- 
lute integrated volatility of the asset and the case g{x) = l/x to its relative integrated 
volatility, that is, the integral of the squared diffusion coeSicient of the logarithmic price. ^ 



This is an electronic reprint of the original article published by the ISI/BS in Bernoulli, 
2009, Vol. 15, No. 3, 687-720. This reprint differs from the original in pagination and 
typographic detail. 



^Note that the usual notion of integrated volatility refers to the relative integrated volatility. 
1350-7265 © 2009 ISI/BS 
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Assume first that we observe the efficient price data with frequency n, that is, the sample 

In this setting, a common convergent estimator of the integrated volatihty, with rate 
and feasible asymptotic theory, is given by the realized volatility, that is, for the 
absolute integrated volatility 

n 
i=l 

and for the relative integrated volatility 

n 

^(iog(x,/„) - log(X(,_l)/„))^ 

i=l 

see Jacod and Prottcr [16], BarndorfT-Nielscn and Shephard [7], Meddahi [22] and 
Gongalves and Meddahi [12]. 

However, it is a well-known fact that high frequency financial data do not behave like 
an Ito process. In the literature, this gap is often considered to be a "contamination" 
of the theoretical, efficient price and is called microstructure noise. This microstructure 
noise increases with the sampling frequency and is due to several reasons, one of the most 
obvious being price discreteness. 

To get rid of this noise, the first solution is to sample our data over a longer period. 
But, if we imagine we collect one unit of data per second and consider five minutes as the 
finest period we can tolerate to make the noise insignificant, we throw away a lot of data, 
which is hardly acceptable. Consequently, dealing with these high frequency noisy data 
has become a challenging issue. Many recent papers treat this problem, especially for the 
purpose of estimating the integrated volatility; see in particular Barndorff-Nielsen et al. 
[6]; Bandi and Russell [4]; Jacod et al. [15]; Zhang [30]; Zhang, Mykland and Ai't-Sahalia 
[31]; Hansen and Lunde [13]; Ai't-Sahalia, Mykland and Zhang [1]; and Glotcr and Jacod 
[11]. For a comparison between several estimators, see Andersen, BoUerslev and Meddahi 
[3]; Bandi, Russel and Yang [5]; and Gatheral and Oomen [23]. 

In most of these works, one observes at some deterministic times i", z = 0, . . . ,n, a 
log-price Yt^ composed of a theoretical, efhcient log-price Xt^ , coming from the classical 
continuous-time mathematical finance theory, contaminated by an additive microstruc- 
ture noise e^ru , that is, 

where Xt is, for example, an Ito process. In these additive microstructure noise models, 
the developed technologies often aim at reducing the impact of the noise. 

Nevertheless, although price discreteness is largely accepted as one of the main reasons 
for microstructure noise, these models rarely allow for it; see Large [19] and Robert and 
Rosenbaum [25, 26] for models considering discrete prices. In this paper, we study the 
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problem of estimating the integrated volatility of an asset when assuming that the efficient 
price data are observed with round-off error. 

2. Model and results 
2.1. Description of the model 

We consider the model of a diffusion observed with round-off error. Let a„ be a pos- 
itive decreasing sequence tending to zero as n goes to infinity and /3„ = an\/n- On 
a filtered probability space (O, (^t)tg[o_i],P), we consider a one-dimensional Brownian 
semi-martingale {Xt)t^[o,i], taking values in an open interval {vjfJ-), —oo <v<fJ-< +oo, 
of the form 

Xt^xo+ f <j(X,)dW,+ f Osds, (1) 
Jo Jo 

where (W^t)te[o.i] is a (J-t)-standard Brownian motion, {at)t(=[o.i] is a progressively mea- 
surable process with respect to (^t)tg[o,i]j x — > <j{x) is a real deterministic function that 
is not known and xq is a real constant. We observe the sample 

(4"„"\z = 0,...,n), (2) 

where 

Thus. X^"^"^ is the observation of with round-off error «„. This model has already 
been studied by Delattre and Jacod [9] when tends to a constant finite value and 
by Delattre [8] in the other cases. Based on the sample (2). our goal is to estimate the 
random parameter 

9= [ g{Xs)^(j{Xs)^ds, 

where g is a known deterministic function on (j^, /i). 

Note that for the Black-Scholes specification of the model 

cr{x) = ax, 

the problem of the estimation of the constant parameter tr has been partially treated by 
Li and Mykland [20] in the case where tends to zero. 

We denote by C'^(/) the set of k times continuously differentiable functions on / C M. 
We write C^(/) if all the derivatives are bounded. We will consider the following assump- 
tions: 



690 



M. Rosenbaum 



Assumption A. 

supQ;„(logn)^ < oo. 

n>0 



Assumption Al. There exists p > such that sup„>qQ;, 



4 ''(logn)^ < oo. 



Assumption B. 

(i) For all x G {i^,fJ-), cr{x) > 0, 

(ii) x^a{x)eC^{iiy,ti)), 



(iii) flg ds < +00, almost surely. 

Assumption C. 

X g{x) e C^((i/, fij) and for all x e {v, y), g{x) > 0. 

Assumption CI. 

(i) X g{x) e C^{{i^,y)) and for all x e (z^, g{x) > 0, 

(ii) x-^ g'{x) is of constant sign on {v,y) and a; — > |g'(a;)|^/^ eC^((j/,^)). 

By convention, if a; ^ (z^, ju), we set g{x) = = 0. 

Note that on (0, +c»), the functions defined by x 1 and x ^\/x satisfy Assumption 
CI. These functions are those respectively associated to the absolute integrated volatility 
and to the relative integrated volatility. 

2.2. First estimator 

Our estimation method is based on the theory of wavelet methods for quadratic func- 
tionals estimation; sec, for example, Gayraud and Tribouley [10]. Throughout the paper, 
for fc G N and j £ N, we set 



Ijfc(s) l(fe/2^fc+l/(2J)](s), -0(3) =-l[0,l/2](s) +1(1/2,1] (s). 

We define the coefficients Cj^k, jo & k e [0,2^° - 1] and d^^, j e N, fc £ [0, 2-'' - 1] by 



Hence, the Cj^k and djk arc the coefficients of s g{Xs)<j{Xs) in the Haar basis. Conse- 
quently, we have 





2^0-1 



+00 2^ -1 
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We set 

Thus, in the case g{x) ~ 1, Cj^k can be seen as a rescaled local average of the increments 
of the rounded diffusion over a window of size 2~^° . We define our first estimator 0„ of 
9 by 

2^0." -1 
fe=0 

with jo,n = Llog2(a;^^ A y/n)\ . 

2.3. Convergence in probability 

We set r„ — an V n^^/^. We have the following theorem: 

Theorem 1 (Convergence in probability). In model (l)-(2), under Assumptions A, 
B and C, the sequence 




is tight. 

2.4. Compensated estimators 

It seems difficult to obtain a central limit theorem for the previous estimator (see the 
proofs for details). Consequently, we introduce compensated estimators. We set 

2^-1 
fe=0 

and define 

We denote by S the set of all triples (a, (ji^n), (j2,n)), where a is a real number with 
< a < 1 and (ji,„), (j2,n) are two sequences of integers such that 

supa^-"(logn)2 < oo, r„2^^^-"-^'-" ^ 0, r-^2^'-^^{allogn + 1/n) ^ 0, 

n 
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[log2(r„^/^)J and j2,« = Llog2('"n^^^)J , then {p, (ji,„), (j2,n)) G S. For S = (a, (ji,„), (j2,n)) e 



Under Assumption Al, the set S is not empty. For example, if one takes ji^„ = 
[log2(r,T^ 
S, we set 

and consider 



A: 



L(l+a)log2 r-ij 

For S* = (a, (ji,n), (.72, n)) G 5, our final estimator of 9 is 

2J1,™_1 2^0'"-! 



2 

A;=0 k=0 



where 



and lg'>o indicates whether x g' (x) is non-negative or not. 
2.5. Convergence in law 

We state in this section some limit theorems. In this context, it is convenient to use the 
notion of stable convergence in law; see Renyi [24], Aldous and Eagleson [2], Jacod and 
Shiryaev [17] and Jacod [14]. 

Definition 1 (Stable convergence in law). A sequence of variable (Ar„)„gN converges 
stably in law to a variable X {X^ —^cs X) if X is defined on an appropriate extension 
(jl,JF,P) of (ri,JF, P) and if for any ^-measurable bounded variable Y and any bounded 
continuous function g, E,[Yg{Xn)] ^E[Yg{X)]. 

For (3 > 0, we define the function by 

A/3(x) =limE 

n 

with 



E^. 



, j=i 



= /3(7t/2)1/2| L{[/ + /3-^aix)W,.i} + ^ V(x)(W^, - W,^i)\ \ - a{x), 
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where is a Brownian motion and U a uniform random variable on [0, 1], independent of 
W. From Dclattrc [8], wc got that the function Ap is well defined. We have the following 
theorem: 

Theorem 2 (Convergence in law). In model (l)-(2), under Assumptions Al, B and 
CI, for S gS, we have the following stable convergences in law, where B is a standard 
Brownian motion, independent of T : 

tfPn^O, V^{0n{S)-e)^csV2{n-2)^^^ [ g{Xtfa{XtfdBt, 

Jo 

tff3n^P>0, V^{k{S)-e)^cs'2 [ g{Xtfa{Xt)[Ai3{Xt)Y'^dBu 

Jo 

tff3n^+^, a;,\L{S)-e)^cs^j^ g[Xtf<j{Xt)dBt. 

3. Discussion 

3.1. Comments on the results 

• Our microstructure model with round-off error is obviously built to face the problem of 
price discreteness. Indeed, market price increments have to be multiples of the tick size. 
Moreover, it is striking to see how high frequency financial data do look like diffusions 
with round-off error; see, for example, [28]. In particular, the well-known bid-ask bounce is 
reproduced in this model. Furthermore, if the sampling period becomes large, the round- 
off error becomes insignificant. According to the theory and the empirical studies, this is 
also the case on the markets where it is often admitted that low frequency financial data 
can be modeled as data coming from a diffusion process. Hence, this model is relevant 
because it is clearly linked with market observations and financial theory. 

• Our point of view is different from those of an additive microstructure noise. We do not 
make assumptions on the difference between the observed log-price and the theoretical 
log-price but on the observed price itself. Hence, our method is not a denoising method. 
We directly use the properties of the noisy data. Moreover, Li and Mykland [21] have 
proved that estimators built for additive noise, like the two scales estimator of Zhang, 
Mykland and Ai't-Sahalia [31], are not robust in the case of a "quite big" rounding error. 

• The estimation rates arc the same as those obtained by Delattre [8] for other procedures 
on this model. In particular, if the order of magnitude of the round-off error is smaller 
than we find the classical parametric rate. 

• More general forms of stochastic volatility seem difficult to treat with our wavelet 
technique. Indeed, our proof of the central limit theorem relies on the fact that under an 
equivalent measure, the process can be written as a function of a Brownian motion, which 
is not the case for general stochastic volatility models. Nevertheless, Theorem 1 remains 
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true in the case where the instantaneous volatility is of the form a{x,t) = gi{x)g2{t), 
with gi and g2 as two positive functions such that gi G C^(R) and g2 £ C-^([0, 1]); see [28]. 
• The integrated volatility cannot be recovered in a pure rounding framework where the 
sequence q;„ is supposed to be constant as n goes to infinity. Nevertheless, it can be done 
for some particular rounding procedures; see Jacod et al. [15]. 

3.2. Intuition for the results and important ideas 

To give some intuition for the results, introduce important ideas used in the proofs, and 
explain why methods based on the quadratic variation do not work here, we recall and 
explain an inspiring result of Delattre [8] when /3„ tends to infinity. 

3.2.1. The behavior of the p-variations 

Let h be the density of a standard Gaussian variable and 

7p(a,/3)= I'du [ dyh{y)\{f3u + ay)^^^\''. 







It is shown in Delattre [8] that if tends to infinity, that is, if the round-off error is 
quite big, for p > 0, we have 



n „i 



(X,),/?„)ds 

tends to zero in probability. The stable convergence in law of this sequence normalized 
by has also been proved in [8]. 

3.2.2. Remarks and explanations 
The point is to remark that if p = 1 , 

Plr' /\p(a(X,),/?„)ds=(2/7t)i/2 [\{Xs)ds 
Jo Jo 

and that if p > 0, 

Jo Jo 

tends to zero in probability. So, in the case /3„ tends to infinity, if p is not equal to one, 
contrary to what happens when considering non-noisy data, a{Xt)^ does not appear in 
the limit of the sum of the rcscaled rounded increments to the power p. Thus, estimating 



/ a\Xs)ds, 
Jo 
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seems more complicated than estimating 

/ <j(Xs)ds. 
Jo 

We now give an intuition for this surprising behavior of the p-variations through a non- 
rigorous argument. Introducing several important ideas, we explain why, when f3„ tends 
to infinity, 

E.(x,_,,„)[l4""' - 4-i)aJ'] ^ </3,;i(2/7t)VV(X(,_i)/„), 

where K^(^Xi^i_iy„) denotes the expectation conditional on cr(^(i_i)/n)- We define the 
fractional part of Xt by {Xt} = Xt — lXt\ . First we have to remark that 

^z'/'n^ ~ ^{i-l) In = V{X(i--\)lnl "n} + (-'^j/ri ^ -'^(r- !)/« ) /"nj • (3) 

Kosulajeff [18] and Tukey [29] have established that when a is small, {Xja) is almost 
independent of X and approximately follows a uniform law on [0, 1]. More precisely, the 
following result has been shown by Delattre and Jacod [9] . 



Lemma 1 (The fractional part of a variable). Let k be a function on R, C" (r > 1), 

integrable with integrable derivatives. Let f be a function on R x [0,1], C in the first 
variable and such that for < I <r. Mi = sup^ \ -^f{x, u)| du < +oo. Then 



k{x) 



f{x,{x/a}) - / f{x,u)du 





dx 



< (2a)'" sup Ml sup / 

0</<r 0</<rjR 



Ql 

dx' 



dx. 



Thus, since 

Xi/n - X(^^_iyn ~ "'(-'^(i-l)/™) (W^i/™ ~ W^(i-l)/n), 

we have 

where [/ is a uniform variable on [0, 1], independent of X, and y is a standard Gaussian 
variable, independent of X and U. Therefore, if /3„ tends to infinity. 

We conclude our argument using the simple but nice fact that if C/ is a uniform variable 
on [0, 1] and Z is independent of [/, with a density with respect to the Lebcsgue measure, 

E[|LC/ + ZJ|]=E[|Z|]. 
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4. Proofs 

In all the proofs wc use the previously defined notation. For technical reasons, we suppose 
without loss of generality that for given j, n2~^ is a positive integer. In the following, c 
and Cp denote constants not depending on n, j or k and that may vary from line to line. 

4.1. Preliminaries for the proofs of Theorems 1 and 2 

4- 1.1. Localization procedure 

We slightly adapt here a classical localization procedure used, for example, in Delattre 
[8]. It will enable us to replace Assumptions B, C and CI with much stronger assumptions 
in the proofs of Theorems 1 and 2. We fix two sequences (t'q)g>i and {iJ,q)q>i such that 
(z^g) is strictly decreasing to i/, {fiq) is strictly increasing to /z and {fig — Vq) > 0. We also 
fix a sequence of fimctions Xg : K ^ [0, 1] such that Xq & C^{^), {XqY^^ & C^i^) and 

Xqix)^l on [vq,^J,q] and Xq{x)=0 on (-oo,t^g+i] U [/.ig+i,+oo). 

For qGN, we define on R the real functions aq and gq by 

aq{x)=a{x)xq{x) + il-Xq{x)), gq{x)^g{uq)+ j g'{s)xq{s)ds. 

We finally set 

Tq = mfl^te [0,1], Xt<iyq+aq OTXt>^iq Or j al(ls>q^M. 

Under Assumption B, Tq tends almost surely to 1 and P(Tg = 1) ^ 1 &s +oo. Let 
(W^,t > 0) be defined by = I^(T,+t)Ai - Wr, and (rt')t>o be the solution of 

dY,''^aqiY,'^)dWl YS = Xt,. 

Consider now the process (X(')tg[o ij defined by Xf = Xt for t E [0,Tq] and X^ = Yt-^T 
for t £ {Tq, 1]. This process satisfies 

dX^ = aq{X^)dWt + a1 dt, 

where a' = at for t € [0,Tq] and a' = for i G {Tq, 1]. The process X'' coincides with the 
initial process X on [0,rg]. Let go = infj^iMq > ^q + For q > go, on [0,rg], gq{X^) 
coincides with g{Xt) and for n> q, (/^(X'^""-') coincides with Finally, under 

Assumption CI, (gq)' is of constant sign on R, |(5g)T^^ G and on [0,Tg], {gqY{X^) 

coincides with g'{Xt). Furthermore, for n>q, (gg)'(X('^""'*) coincides with g'{xj:°'"''). 

Hence it is enough to prove Theorems 1 and 2 for the processes X'^, for all q>qo, and 
so it is enough to prove Theorem 1 under Assumptions B' and C instead of Assumptions 
B and C and Theorem 2 under Assumptions B' and CI' instead of Assumptions B and 
CI, with Assumptions B', C and CI' defined the following way: 
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Assumption B' . 

(i) There exists c > such that for all a; € K, cr{x) > c, 

(ii) x^a{x) eC^{R), 

(iii) sup^gfj /g ds < +00. 

Assumption C . 

X g{x) e C^(R) and there exists c > sueh that for all x G K, g{x) > c. 

Assumption CI'. 

(i) a; ^ g(.T) G C^(R) and there exists c> such that for all x S M, g{x) > c, 

(ii) X g'ix) is of constant sign on R and x Ig'lx)]^^"^ € C^(R). 

4-1-2. Change of probability 

Under Assumption B', by the Girsanov theorem, we can construct a probability P' on 
(f2,^), absolutely continuous with respect to P and a Brownian motion under P', {Wl, t > 
0) such that 

dXt = a{Xt)dWi + ^a{Xt)-^a{Xt)dt- 

Assumption B' holds for this representation. We define the following supplementary 
hypothesis: 

Assumption D. 

The convergence in probability and the stable convergence in law being preserved by an 
absolutely continuous change of probability, it is consequently enough to prove Theorem 
1 under Assumptions A, B', C and D and Theorem 2 under Assumptions Al, B', CI' 
and D. Under Assumptions B' and D, Xt — h{Wt) with h:x ^ S~^{x + S{xn)) and 

r 1 
Jo cr(y) 

For simplicity, we suppose now that xq = 0- Note that X is a homogeneous Markov 
process with transition densities 

Pt{x,y)^a{y)-\2nt)-^/^cxp[-{2t)-\S{y)^S{x)f]- 

Moreover, the following inequalities hold; see, for example, Delattre and Jacod [9]. 

dy<ct-^'+^'>^^, i + j<2, (4) 



dx"^ dx^ 



Pt{x,y) 
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dx' 



■<lt{x,y) 



(5) 



with qt{x, y) = pt{x, x + y). Wc now give the proofs of Theorems 1 and 2. 
4.2. The behavior of the samphng functions 

We give in this section a key proposition for the proofs of Theorems 1 and 2. As in 
Delattre [8], we consider the following assumption: 



Assumption E. Let {x, u, y) — > fn{x, u, y) be a sequence of real functions on R x [0, 1] x 
R. The sequence /„ satisfies Assumption E if the functions /„ are twice continuously 
differentiable with respect to the first variable and if there exists 7 > such that for all 
n> 1, 

(i) \fn{x,u,y)\<jil + Pl)il + \y\-'), 

(ii) J^\Ux,u,y)\du<j{l + \y\-'), 

(iii) |£,/„(x,M,y)|<7(l+/32)(l + |yr),i = l,2, 

(iv) J^\£Tfn{x,u,y)\du<j{l + \yn, 1 = 1,2. 

Notation. For some sequences of real functions x s- gn{x) on R and (x,u,y) — > 
fnix, u, y) onRx [0, 1] x R, wc define 



2^/2 " 



l)/r; 



i=l 



and 



2^/2 



Let ha^ be the density of a centered Gaussian variable with variance . For a real function 
{x, u, y) f{x, u, ?/) on R X [0, 1] x R, we set 



mf{x,u)= / /v(x)(2/)/(a;,M,y)dy, Mf{x)= / mf{x,u)du. 
Jr Jo 

The following proposition is a general result on the behavior of the sampling functions. 



Proposition 1 (Behavior of the sampling functions). Let {x,u,y) fn{x,u,y) be a 
sequence of real functions on R x [0, 1] x R satisfying Assumption E. Under Assumptions 
A, B' and D, 



(v^" in, fn) - 2^/2 ^ 1^.^ {s)Mfn{X,) d.^ 



<cri 
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for < j < [log2 r-^\ and < fc < 2J - 1 . This holds f or < j < 1(1 + p) log^ r~^\ under 
Assumptions Al, B' and D. 

4.3. Proof of Proposition 1 

In this proof, we widely use the methods and resuhs developed by Delattre in [8] . We set 
p to zero if only Assumptions A, B' and D are satisfied and write Ejir^ for the conditional 
expectation with respect to Tt- 

4-3.1. Fundamental decomposition 

Notation. Let sjk = [2~^nk + 1, . . . , 2~^n{k + !)]• We use the following notation: 
u)^ qi/nix,y)fnix,u,y/ny)dy, A/„/„(x-) = / to„/„(x, m) dw, 



mrjn{x) = mnfn{x,{x/an}) ~ Mnf„{x), li fn{x) = j Pi/ n{x ,y)m.n fniv) . 

We set 

^r(/n)=/r-^/n/n(^(»-l)/n), 
riA(i+i-l) 



and 



23/2 " 



n 

i=l 



HjkifnJ) = 2^ [m„/„(Xj/„) - m„/„(X(i_i)/„)] 

~ 2^ ^"„-i)A(i-l)/n(-'^(i-l)/n)l2<(n-i)A(i-l), 
2^/2 (ri-i-l)A(i-2) 



Remark that for given n and z £ Sj^, 



2J/2 
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is a (J-"t)-niartingale in y. The following fundamental decomposition will be constantly 
used. 

Proposition 2 (Fundamental decomposition). 

JO 

+ V^Hn, Mfn) - 1^'^ C l,k{s)MU{Xs) ds - i7]U/n, - A7,(/„, I). 



Proof. We have 

57{fnJ) = V?{fn) - Af„/„(X,/„) + Af„/„(X(,_i)/„) - E^,,_„,„ [/n + E^,/„ [/r+i] 

riA(i+i-l) 

- E^,^_,,^„ b?r+i(/")] + E (E^./.. [<(/")] - iE^,_„/„ ['yrc/n)]). 

2=1 + 2 

Using that 

^y^./Jfi+i] = J qi/n{Xi/n,y)fr,{X^in,{Xi/n/an},Vny)dy, 

we get 

nA(i+i-l) 

2 = 1 + 2 

= iiifn) + m„/„(Xj/„) - m„/„(X(j_i)/„) - E^(._j,^„ ["T„/„(Xj/„)] 

nA(i+;-l) 

Ej^(;_ij^„ ['7lri/n(^(z-l)/ri)])- 

2:=i+2 



Since 



we obtain 



E;r^/,Jm„/„(X(^_i)/„)] = / P{z-i-i)/n{X^/n,y)'mnfn{y)dy, 



^iifn,l) = Vi if n) +rnnf nix i/n) - TO„/„ i)/„) 

(n-i)A(i-l) 

— /ri(-'f(i-l)/ri) + E [C-l/n(^i/") ~ ^"'^"("'^(»-l)/n) 
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Thus, 



Stifn) = Viifn) + mnfn{Xi/„) - m„/„ i)/„) 
~ ^(ri-i)A(i-l)/n(^(i-l)/")l2<(n-0A(i-l) 
(n-i-l)A(i-2) 



We finally get 



V^'^in, /„) - V^'^in, Mnf^) = ^ l,fe(i/n)7?r(/n) 
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4-3.2. Technical lemmas 



We prove here some useful lemmas. In particular, they will enable us to control the 
different terms of the decomposition. We begin with a usual Riemann approximation. 

Lemma 2 (Riemann approximation). Let f eC^ and 

2j72 



ri „i 

■VlJfc(^/n)/(X,/„)-2^■/2 / i^.^(s)/(x,)ds. 
.=1 -^0 



Then, 



Proof. Let 



We have 



2] <r r^o-ir,-'^ 



E[Ai]<c2 



J(i-1)/Ti 



|ai<2^/2/ |/(X,)-/(X,/„)|ds. 

■/(i-l)/ri 

Since / e C^, using the Burkholder-Davis-Gundy inequality, we get E[(^^)2] < c2^n~^ . 
Now, An = X]"=i with n2~-' terms in the sum. Thus, 

n n 

nAl] < 5^ 5^(E[(a)2]E[(a?])'/' < c2-^n-i. 

The following lemma is a consequence of Assumption E together with Lemma 1 and 
inequalities (4) and (5). Details can be found in Delattre [8]. 
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Lemma 3. 



/" 

^0 



\mnU{x,u)\<cil + f3l), 



■771„/„(X,U) 



dx' 



Mnfn{x) 



<c, 0<i<2, 



|A/„/„(x)-A//„(x)| <cn-i/2, 

\l'^fn{x)\<cal{l + n/i). 

We end this section with the foUowing bounds for and A'"^.. 

Lemma 4. 

|if;fe(/n,OI < cV'^n-^+cV'''al[\+n2-^{l-l)-' + (log n) 1^=2. 

Proof. From inequahties (6) and (9), we get 

01 < c2^'/'(l + Pi) + cV'^c^l n[{l - ly' + [n - 

We also have 

n 

z — 1 i^Sjk 



(6) 

(7) 

(8) 
(9) 



< c2J/2a^^(l + n/z) < c2^^^alnlog7 



□ 



4-3.3. End of the proof of Proposition 1 

Until the end of Section 4.4, we take I = n and omit this index in the notation. We 
now bound the different terms of the fundamental decomposition. By Lemma 4, since 
< j < L(l + P) log2 , we get 

\HUfn) + K^k{fn)\ < cV'\n-^ + al logn) < cr„. 

Inequality (7) together with Lemma 2 on Riemann approximation give 



E 



Inequality (8) gives 



V'\n,Mf^)-V/^ / tju{s)Mf,,{Xs) 
Jo 

|T/^'=(n,M„/„ - M/„)| < 2--'"/2n-i/2. 



<c2-%-i. 
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We now turn to the approximation term A^"j,(/„). We have 



From the results of Delattre [8], Chapters 7 and 8, we can show that 

EiSnff] < c{nal + (1 +/3,^)(1 + a,,{n/^y/')). 
Since X]r=i ^ "^^/n, we have 

E[M:}k{fnf] < cal + c(l + /32)(n-i + 2^"/2„^^„-i) 

< c{al + n-^ + 2^/2a„n-i + (10) 
Putting all the inequalities together, we obtain Proposition 1. 

4.4. Proof of Theorem 1 

Using the remark on the change of probability in Section 4.1.2. the following proposition 
implies Theorem 1. 

Proposition 3 {L^ convergence, absolute integrated volatility). Let 9n be the 

estimator defined in Section 2.2. Under Assumptions A, B', C and D, 

E[\en - e\] < crn, 

with c a constant not depending on n. 

For expository purposes, we first treat the case g{x) = 1. 

4-.4-1- Proof of Proposition 3 in the case g{x) = 1 
We assume here that g{x) = 1 and set 

/„(x,u,y) = (7T/2)i/2/3„|Lu + /3-ij;J|. 

This specification implies 

M/„(X,,) = a(X,). 

We begin with a lemma on the behavior of the wavelet coefficients. Let Cj^k, djk and Cj^k 
be as defined in Section 2.2. Thanks to the vanishing moment of we easily get the 
following result: 
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2^-1 



2^-1 



'-jk- 



k=0 



fe=0 



We have the fohowing lemma: 

Lemma 6. Let 0< j < L(l + p) log2 '",7 , then 

nZj\ + \Z,\]<cr^. 

Proof. We have Zj = Zj^i + Zj^2 with 

r{k+l)/2^ 



2" - 1 / r(k+l)/2-' X 



?7. 

fe=0 iSSjfc 

We easily get E[|Zj_i|] < cr„. For Zj^2, we have 

2^-12^-1 

n\z,£] = ^Yl E E E E[a(x,2-.)^(x,,2-.)<^.<5^]. 

k=0 fe'=0 iGSjfc j'eSjj./ 

For i 7^ i', conditioning by ^max , we get 

E[a(X,2-.)a(X,.,2-.)M-] = 0. 

Therefore, 

2^-1 



K[i%2n = ;J,Ei^ 



k=0 
2^-1 



E^? 



fe=0 



For Zj, recall that 



9J72 
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n—i — l 



and that 

;^/„ iX,/„ ) E^^/„ [m„/„ (X(,+^)/„ )] . 
The same method gives the result. 



□ 



We now end the proof of Proposition 3. From Proposition 1 and equation (3), we can 
write 



and 



We have 



E[\en-e\]<ci 



E[|e,o,„fc-c,o.„fc|']<cr^ 



J = Jo, ,1 + 1 fc 



+ c(r„ + 2^«-r2), 



where TZ^ is equal to 

M„/„ - M/„) + /„) - 2^«."/2 [ „fe(s)M/„(X,) ds - ff,^,„fe(/„). 

By Lemma 5, we have 



E 



E E4. 

.j=j0,n k 



Moreover, using the preceding computations, it is easy to see that 



E 



< c(2J«-"n"^ + 2^0'" + n"^/^ + al logn). 



The result follows. 
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4-4-^- Proof of Proposition 3 in the general case 
Let 

We easily get the result in the same way as in the previous proof, remarking that 

r^2^°-"/'^ ^ (a ) (a ) 

|cjo,.fc - c*„_„fc| < can J -^^2^1jo,„fc(i/n)|X"„" -^(»-"i)/„l- 

Consequently, using Proposition 1, we obtain 

n\cjo..k - c,o.„fcn < cE[|c,„.„fc - c;_„,n + cE[|e*o,„. - c,„,„fep] < crl 

and 

k k 

with E[|^|] < can- The result follows. 
4.5. Proof of Theorem 2 

In this proof, Assumptions Al, B' and D are in force for a„ and X . We also assume until 
the end of Section 4.5.2 that g{x) — 1. 

4-. 5.1. Compensator 
We have 

^Jofc - / cr{Xsf ds = ^(cjofc - Cj„kf + 2 ^ Cj^k{cj^k - Cj^k) - 4fe- 

k k k j>jo k 

The central limit theorems will be derived from the double product term. If, as previously, 
we choose jo such that 2^° is of order r~^, re-normalized by r~^, the two other terms 
do not tend to zero. Hence, we can either choose 2^ > r~^ and compensate the first 
term or choose 2-' < r~^ and compensate the last term. The first method is classical in 
quadratic functionals estimation. However, it seems difficult here. Indeed, a compensator 
of Sfe(cjfc ~ Cjfe)^ requires an accurate enough estimation of the function x — > a(x). 
Consequently, we compensate the last term. This is unusual, but possible in our specific 
setting. Of course, a one-by-one estimation of the coefficients djk is probably not suitable 
for building the compensator. This is simply because the error between the coefficient 
(iji^ and its estimation is of the same order as the error between the coefficient c^j, 
and its estimate. That is why we use here the following scaling property of the wavelet 
coefficients. 
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Lemma 7. Let 



Qj =V4fc, G(it)= / V(u)du and c(V') = / G^{u)du. 
I. Jo Jo 



We have 



E 



2JQ, -c(Vj) / h'iWu^du 



Proof. We briefly give the main steps of the proof. Details can be found in Rosenbaum 
[27]. First we define 

d'jk= J ^okWtdt. 

We easily get that the d'^f. are independent centered Gaussian variables such that 

Let ^ : [0, 1] ^ R be a deterministic bounded function, vanishing outside the interval 
[fc2-J°,A:'2-^n] C [0,1] and define 



k2-o ■ 



fe=0 



One can show that 



E[Sj(0 ] < csup(e( )|fc' - fc|2^-^°. 



Using a decomposition of the function h'"^ in a wavelet basis, these results enable us to 
prove that 



(11) 



Since "0 has a vanishing moment. 



^,kit)hiWt)dt^h'iWk2-.) / ^jk{t)Wtdt 



and so 



d%^h'iWk2-,fd'% 



We conclude using equation (11) together with a Riemann-type approximation. □ 

The following lemma shows that our method enables us to estimate the remaining 
coefficients accurately enough. 
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Lemma 8. Let S = [a, (ji,n), (.72, n)] G S. Then, 



k j>jl,n k 



- fc 

Proof. We want to compensate 

L(l+a)log2 r-ij 

Wc know that for big enough j and j2,n < J, is close to 2^'^-'^~^Qj^ ^. Therefore, we 
estimate the preceding quantity by 



with 



for appropriate ji^n and j2,ri- Let 

L(l+a)log2r-iJ L(l+i)log2r-iJ 



E Q^- E 2^^'""'^^- 



and r = c(V') /o^ dw. We have 

L(l+a)log2 r-ij 



0=31,: 



Using the same arguments as for Proposition 1, for ji^„ < J < [iX + log2 ''n , we get 
Hence, we also obtain 

n\djk\\djk~djk\]<c2-^rn. 

Consequently, we have 
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As a > 0, it is clear that 

i>L(l+a)log2r-ij 

4-5.2. Limit theorems 

We prove in this section Theorem 2 in the case g{x) = 1. Let 
/„(x,«,y) = (7t/2)i/2/3„|[u + /3-i2/J|, 

and z" = — I(i^i)/n ^^fni^s)- We begin with some intermediary lemmas. 
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□ 



Lemma 9. Let 



2>./^ E 



+r-i2J"/2(n^i logn) + 2J"r„^0, t/ien r;:iri(j„, n) ^ 0. 

Proof. We write Mjk{ln) for MjkifnJn), Ti for Ti{jn,n) and j for j„. We use the 
decomposition Ti = Tu + T12 with 



Mjk{ln)- 



We easily get 



E[|Ti2|] < c2-^'/2r„ + c2J'/2(^-i ^ ^f, logn + aln2-^ /k 



We take Z„ = [n/lognj and therefore r„-^E[|ri2|] tends to zero. We set F{x) ~ <T[h{x) 
The term Tn can be written as 



V / l,k{^){Ws-Wk2-.)F'{Wk2-.) 



5.{ln) 



ds 



2^/2^i?fcX,fe(/„), 



with E[|i?fcp] < 2 . Following Delattre [8], Chapters 7 and 8, there exists 5i{ln) such 
that for In = [n/lognj 

n\5Sn)~Uln)?]<cnrl/\ogn, 

n5^{lnf] < c(l + + anln/iY'-"). 
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Consequently, we obtain that the expectation of the second term of Tn is less than 
2~''^^r„. The first term can be written as Ai + A2 + A3 + + with 



k i£Sjk 



^4 = E E ^'Rfc + 1)2"' - ^|n\{W,,n ~ W^(.-i)/n)i^'(W^fe2-.)^ds, 

k iesjk 



k iGSjfc 

We easily get that E[A^ < c2~^rl. For A2, we have 

9i/2 / X 1/2 

IEOA2I] <c (sup(E[*,a„)2])) <c2-'/2(l/n + a2). 



We now turn to A3. We write here Si for 6i{ln)- We easily obtain that Ei^a] is equal to 



(fc+l)2"J' /.(fc+l)2"J' 



i' / r. 



dsds'. 



We consider the quantity 



A A 



Suppose that i > i' and s' > i/n. Then 



n n ' 
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FinaUy, 



Hence, 













ds ds 


Ji/n Ji/n 






ml] < ^:2-^^^ 







For A4, consider the function ^ defined on [0, 1] by ({t) = l — t and Cjk{t) = 2^^^({2^x — k). 
We have 



fc iSSjfc 

with E[|i?|] <c(l/n + af/^). Using that the function /„ verifies in our case 

|/„(.T,u,y)|<c(l + /3„)(l + |y|), 
following Delattrc [8], Chapter 6, we can show that the quantity 

Vn{Xi/„ - X(j_i)/„)(7(X(j_i)/„) ^6i{ln){l + Pn)~^ 

can be written as 5n(-'^(i-i)/n, {-'^(i-i)/™""}, Vn[Xi/n - The function g„ sat- 

isfies Assumption E. Therefore, since Mgn{x) = 0, using the same arguments as in the 
proof of Proposition 1, we can prove that 



- E Qkii/n)gniX(^i_iy,-„{X(^, 

— 1)/ n*-^n },Vn[Xt/n - Xj^i_iyn]) 



<cri 



Consequently, 



E^'(^'^2-.) E 2-^'/20fc(»/n)(x,/„-X(,_i)/„)^d,' 

fc iesjfc 
For the first term, we use that 

n 

-Y,nMln)~Uln)\'']<crll\ogn 
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and finally 

k iesjk 

< cr„(logn)"^/2. □ 
Lemma 10. Let 

Proof. We write for ^2(?^) and set = (2/7t)i/2. We have 

T2 = C„ E / ['^(^■^) - ds. 

We can write 



l»-l)/«)]^ds 



, J(»-l)/n 

/ [<^iXs) - CT(X(i_i)/„)]CT(X(j_i)/„) ds. 

,- J(i-l)/n 



Ito's formula gives 

ri/n 



T2 = c„E/ ds(T(X(,_i)/„) / a'(Xt)a(XOdm 

J(i-l)/n J(j-1)/ti 



/ dsa(X(,_i)/„) / -a\Xtfa{Xt) + -a(Xt) dt + 

, JU-l)ln J(i-l)ln\'^ ^ J 



r-ijn 

/(2-l)/n -'(^-l)/ 

with E[|i?|] < c/n. Finally, we obtain 



p-%jn p-s 

T2 = R + c„y2 ds(7(X(,_i)/„) / a'(Xt)a(Xt)dm+i?', 

, J(*-l)/n J(i-l)/n 

with E[|E' I] <c/n. Let 



i/n ps 

dsa(X(,_i)/„) / a'{XtMXt)dWt. 

{i-l)/n J{i-l)/n 
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For i' < i, Ejp-.,^^ [yyi] = 0. Hence, for given n, = ^ martingale. Conse- 

quently, 

1=1 

Since 



n 



{t-l]/n 



ds a(X(,_i)/„) / a'{Xt)a{Xt)dWt 



{t-l)/n 



we get 



Lemma 11. Let 



E[{Mn'] < c/r 



+r-i2J"/2(n-i logn) + 2J"r„-^0, then r~^T3{j„,n) ^ 0. 
Proof. We write for T3{jn,n) and j for j„. We have 



+ r, + R,], 



with 



□ 



n = -[m„/„(Xi/„) - m„/„(X(j_i)/„)] 



^ n — i—l ^ 

- [IzfniXi/n) -lzfniX(i-i)/n)] ^^_i/n (-'^(i-l)/ri )li<n-2 



and < cn We easily get 



E 



^i{ln) 



k i£Sj 



2n 



The second term of the decomposition can be written as 
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= Bi + B2 + 

with 

-Bl = X] i'^i^t/n) ~ <^iX{t-l)/n)]n, 

k iesjk 

i?2=^[a(X(,+i)2-.)-a(X,2-.)] 

k i&Sjk 

k iSSjk 

Using Lemma 4. wc obtain 

E[\B2\]<V/''(-+al\og 
For Bi we consider the decomposition Bi = Bn — B12 with 

i 

B12 = E['^(^V") - {^-^ + i?. 

Using the same method as for A^, we get E[|_Bi2|] < cr„ (log 77,)"^/^. We have for the other 
term 

i 

\- WW g(^(»-l)/„) 

+ i? 

with E[|i?|] < c/n. Therefore, we easily get that E[|i3ii|] < cr„n^^^^. We now treat B3. 
The quantity 

E E ['^i-^{k+l)2-i) - <^iXi/n)]-['>T^nfn{X^/n) -rnnfniX^i_iy„)] 
k iGSjk 



can be written as 



E E i'^i^ik+l)2-i) ~ '^{Xi/n)]-mnfn{X,/„) 
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X! [0'(-'^(fc+l)2-j) - Cr(Xj_i/„)]-77T„/„(X(j_i)/„) 
k i&Sjk 

k iGSjfc 

This is equal to 

-^[<^{^(k+l)2-i) ~ (^{Xk2-i)\-'mnfn{Xk2-i) 
k 
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k iSSjfc 



with E[|i?|] < cr^. Eventually this term is equal to 

+ ^ ^ {Xi/n ~ X(^i_iyn)<^' {X(^^_iyn)-'rnnfn{Xi^i_iyn) + 

I. .V- ^ 



k iesjfc 



with E[|i?'|] < c2^/'^rl. The quantity 

— 1) /n)'^n fn{X(i^l)/n){y +/3n) 

can be written as gn(^(i-i)/ra) {^(i-i)/nQ:n}) \/n[Xiin — X(^i_iy„\). This function satisfies 
Assumption E. Hence, since Mgn{x) =0, we obtain 



We now treat 

(n-i-l)A(;„-2) 



_^ (n-i-l)A(;„-2) 



This can be written as 



X! ['^(^(*;+i)2-3) - CT(Xj_i/„)]-;"/„(X(i_i)/„) 
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+ X! Wi^i/n) - (^{X(i^l)/n)]-lzfniX(i-i)/n). 

k z i^Sjk 

Hence, it is equal to 

k z 

+ X] i^i/n-X{i-l)/n)o''iX(^i_iy„)—lzfniXl^i^i)/„)+R, 

k z i&Sjk 

with < ca^logn. This is finaUy equal to 

z k i€Sjk 

with E[|i?'|] < c2^^'^allogn. The quantity 

can be written as 5„(X(i_i)/„, y^[Xi/„ — X(^i_iy„]). This function satisfies Assumption 
E. Hence, since Mgn{x) =0, 



E 



Eventually, 



E 



<c7i ^a^j^l + n/z). 



<ca^,(l + logn). 



z k iesjk 

It is also clear that 



E 



[<^{^(k+l)2-i) - <^{^^/n)]-ln-ifn{X{^-l)/n) 



< ca^j(l +logn). 



□ 



We finally have the following result: 
Lemma 12. Let 

'^U ^J2^3,^k{Cj„k-Cj^k)- 

k 

logn) + 2^"r„ —f 0, then we have the following stable conver- 
gences in law, where B is a standard Brownian motion, independent of T : 



if Pn 



1 

v2 Jo 
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lfPn^l3>0, 
if Pn +00, 

Proof. We have 
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Jo 



E 



2^"/' / l,„fe(s)^(X,)d.s 



^ (jn , n) + r2 + T3 ( jn , n) + T4 (n) , 



with 



r4(n) = ^<7(X(,_i)/„)gr- y <7(X,)Af/„(X,)ds. 

i 

We get the result by applying the results of Delattre [8], Chapter 2, to the term T4 and 
using that r-^{Ti +T2 + T3) tends to zero in probability. □ 

The proof of Theorem 2 follows using Lemma 8. 

4-5.3. Proof of Theorem 2 in the general case 

We give a sketch of the proof of the result in the general case. We have the following 
lemma: 

Lemma 13. If 2-^" + r-^2^"/^{n-'^ + al logn) + 2^"r„ 0, then we have the following 
convergences in stable law, where B is a standard Brownian motion, independent of J-: 

V 2 Jo 
if Pn^P>0, V^C.^^cs-^ f g{X,)g'{Xs)a{X,)^ds 



+ / giXt)^a{Xt)[Afi{Xt)]'/^dBt 



ifPn 



g{X,)g'{X,)aiXs)^ds 
-^j\{XtYa{Xt)ABt. 



Proof. In this case, we have by analogy with Section 4.5.2 

fnix,u,y) = {n/2y/'^g{x-anu)Pn\[u + y/Pn\\- 
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Let 

fnix, u,y) = {n/2y/^gix)PrMu + 2///3„J |. 

We have 

k k 

with E[|Z|] < can- Hence, we easily get the result when /?„ tends to zero. We also have 

+ cj„ fe (^2^- J lj„ k (s) [M/„ (X, ) - Mfn {Xs )] ds^ 

A bias is induced by the second term if /3„ docs not tend to zero. Indeed 

a-'[MMX,) - MfniXs)] « -i.g'(X,)a(X,). 

Since 

2™>"-l 1 



we are able to compensate this bias. We conclude the proof of Theorem 2 using that if 
Un tends to U in probability on and tends to Y in stable law, then (L/„,y„) tends 
to ([/, Y) in stable law. □ 
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